Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot

نویسندگان

  • Kentarou Hitomi
  • Tomohiro Shibata
  • Yutaka Nakamura
  • Shin Ishii
چکیده

A class of biped locomotion called Passive Dynamic Walking (PDW) has been recognized to be efficient in energy consumption and a key to understand human walking. Although PDW is sensitive to the initial condition and disturbances, studies of Quasi-PDW which incorporates supplemental actuators have been reported to overcome this sensitivity. In this article, we propose a reinforcement learning method designed particularly for Quasi-PDW of a biped robot whose possession of knees makes the system unstable. Simulations show that the learning is quickly accomplished after 1000 episodes, and the obtained controller is robust against variations in the slope gradient and sudden perturbations. c © 2006 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

From Passive Dynamic Walking to Passive Turning of Biped walker

Dynamically stable biped robots mimicking human locomotion have received significant attention over the last few decades. Formerly, the existence of stable periodic gaits for straight walking of passive biped walkers was well known and investigated as the notion of passive dynamic walking. This study is aimed to elaborate this notion in the case of three dimensional (3D) walking and extend it f...

متن کامل

Energy Dissipation Rate Control Via a Semi-Analytical Pattern Generation Approach for Planar Three-Legged Galloping Robot based on the Property of Passive Dynamic Walking

In this paper an Energy Dissipation Rate Control (EDRC) method is introduced, which could provide stable walking or running gaits for legged robots. This method is realized by developing a semi-analytical pattern generation approach for a robot during each Single Support Phase (SSP). As yet, several control methods based on passive dynamic walking have been proposed by researchers to provide an...

متن کامل

Dynamic Control Algorithm for Biped Walking Based on Policy Gradient Fuzzy Reinforcement Learning

This paper presents a novel dynamic control approach to acquire biped walking of humanoid robots focussed on policy gradient reinforcement learning with fuzzy evaluative feedback . The proposed structure of controller involves two feedback loops: conventional computed torque controller including impact-force controller and reinforcement learning computed torque controller. Reinforcement learnin...

متن کامل

Episodic Reinforcement Learning Control Approach for Biped Walking

This paper presents a hybrid dynamic control approach to the realisation of humanoid biped robotic walk, focusing on the policy gradient episodic reinforcement learning with fuzzy evaluative feedback. The proposed structure of controller involves two feedback loops: a conventional computed torque controller and an episodic reinforcement learning controller. The reinforcement learning part inclu...

متن کامل

Falling of a Passive Compass-Gait Biped Robot Caused by a Boundary Crisis

The planar passive compass-gait biped robot on sloped surfaces is the simplest model of legged walkers. It is a two-degrees-of-freedom impulsive mechanical system known to exhibit, in response to an increase in the slope angle of the walking surface, a sequence of period-doubling bifurcations leading to chaos before falling down at some critical slope without any explanation. The fall is found ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Robotics and Autonomous Systems

دوره 54  شماره 

صفحات  -

تاریخ انتشار 2006